On representation of fundamental frequency of speech for prosody analysis using reliability function

نویسندگان

  • Mitsuru Nakai
  • Hiroshi Shimodaira
چکیده

This paper highlights on a method that provides a new prosodic feature called ‘ reliability field’ based on a reliability function of the fundamental frequency ( ). The proposed method does not employ any correction process for estimation errors that occur during automatic extraction. By applying this feature as a score function for prosodic analyses like prosodic structure estimation or superpositional modeling of prosodic commands, these prosodic information could be acquired with higher accuracy. The feature has been applied to ‘ template matching method’, which detects accent phrase boundaries in Japanese continuous speech. The experimental results show that compared to the conventional contour, the proposed feature overcomes the harmful influence caused by errors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Acoustic Study of Emotivity-Prosody Interface in Persian Speech Using the Tilt Model

This paper aims to explore some acoustic properties (i.e. duration and pitch amplitude of speech) associated with three different emotions: anger, sadness and joy against neutrality as a reference point, all being intentionally expressed by six Persian speakers. The primary purpose of this study is to find out if there is any correspondence between the given emotions and prosody patterning in P...

متن کامل

Analysis by synthesis of speech prosody: the Prozed environment

This paper presents ProZed, an environment for the multilingual analysis by synthesis of speech prosody. The analysis is based on the symbolic representation of prosodic form without reference to prosodic function. The parameters of the model are at present limited to fundamental frequency and duration but the same framework could be extended to accomodate other parameters such as spectral tilt...

متن کامل

The use of F0 reliability function for prosodic command analysis on F0 contour generation model

This paper describes a method of utilizing an “F0 Reliability Field” (FRF), which we have proposed in our previous work, for estimating prosodic commands on F0 contour generation model. This FRF is the time-frequency representation of F0 likelihood, and an advantage of FRF is that it is not necessary to consider F0 errors that occur during an automatic F0 determination. Therefore, it is thought...

متن کامل

A general-purpose 32 ms prosodic vector for hidden Markov modeling

Prosody plays a central role in conversation, making it important for speech technologies to model. Unfortunately, the application of standard modeling techniques to the acoustics of prosody has been hindered by difficulties in modeling intonation. In this work, we explore the suitability of the recently introduced fundamental frequency variation (FFV) spectrum as a candidate general representa...

متن کامل

Toward invariant functional representations of variable surface fundamental frequency contours: Synthesizing speech melody via model-based stochastic learning

Variability has been one of the major challenges for both theoretical understanding and computer synthesis of speech prosody. In this paper we show that economical representation of variability is the key to effective modeling of prosody. Specifically, we report the development of PENTAtrainer — A trainable yet deterministic prosody synthesizer based on an articulatory-functional view of speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997